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RAW SEQUENCE LISTING DATE: 01/25/2002 

PATENT APPLICATION: US/09/82 2,2 9 5 TIME: 16:51:27 

incut Set : N:\Crf3\RULE60\09822295.raw 
Output Set: N:\CRF3\01252002\l822295.raw 



SEQUENCE LISTING 

3 (1) GENERAL INFORMATION: 



5 (i) APPLICANT: Bahija Jallal 

6 Gregory D. Plowman 

9 (11) TITLE OF INVENTION: DIAGNOSIS AND TREATMENT OF 

10 PTP04 RELATED DISORDERS 

12 (Hi) NUMBER OF SEQUENCES: 18 

15 (iv) CORRESPONDENCE ADDRESS: 

17 (A) ADDRESSEE: Lyon & Lyon 

18 (B) STREET: 633 West Fifth Street 

19 Suite 4700 

20 (C) CITY: Los Angeles |^ i\ ; ^ 

21 (D) STATE: California 5**, £ '%'. [ $**«r IT** !t 

22 ( E ) COUNTRY : U.S.A. ' -** fj f 

23 (F) ZIP : 90071-2066 
2 6 (v) COMPUTER READABLE FORM: 

2 8 (A) MEDIUM TYPE: 3.5" Diskette.. 1 = 44 Mb 

29 storage 

30 (B) COMPUTER: IBM Compatible 

31 (C) OPERATING SYSTEM: IBM P.C. DOS 5.0 

32 (D) SOFTWARE: FastSEQ for Windows 2.0 
35 (vi) CURRENT APPLICATION DATA: 

c __> 37 (A) APPLICATION NUMBER: US/09/822,295 

c __> 38 (B) FILING DATE: 02-Apr-2001 

39 (C) CLASSIFICATION: 

4 2 (vii) PRIOR APPLICATION DATA: 

44 (A) APPLICATION NUMBER: 09/081,345 

45 (B) FILING DATE: 
4 8 (viii) ATTORNEY/AGENT INFORMATION: 
5t] (A) NAME: Warburg, Richard J. 



(B) REGISTRATION NUMBER : 32,327 

(C) REFERENCE/DOCKET NUMBER: 2 3 4/253 
(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE : (213) 489-1600 

(B) TELEFAX: (213) 955-0440 



(C) TELEX: 67-3510 

64 (2) INFORMATION FOR SEQ ID NO: 1: 

66 (i) SEQUENCE CHARACTERISTICS: 

68 (A) LENGTH: 3580 base pairs 

69 (B) TYPE: nucleic acid 

7 0 (C) STRANDEDNESS : single 

7i ( D ) TOPOLOGY: linear 

7 3 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/8 2 2 , 2 9 5 



DATE : 0 1/:; 5/20 02 
TIME • j >> '> 1 : 2 / 



75 
7 6 
77 
78 
79 
80 
81 
82 
83 
84 



86 
87 
88 
89 
90 
91 
92 
93 
94 
95 
96 
97 
98 
99 
10 0 
1 0.1 
102 
103 
104 
105 
106 
107 
108 
109 
110 
111 
112 
113 
1 14 

115 

116 
117 
118 
llv 
12<) 
121 
122 
123 



Input Set : N:\Crf3\RULE60\09822295.raw 
Output Set: N:\CRF3\01252002\l822295.raw 

f-pfVGTGCC CTCCCTCAAC CTACTTATAG ACTATTTTTC TTGCTCTGCA GCATGGACCA 
AAGAGAAATT CTGGAGAAGT TCCTGGATGA GGCCCAAAGC AAGAAA ATTA OTAAAGAGGA 
GTT TGCCAAT GAATTTCTGA AGCTGAAAAG GCAATCTACC AAGTACAAGG OAGACAAAAC 
CTATCCTACA ACTGTGGCTG AGAAGCCCAA GAATATCAAG AAAAACAGAT ATAAGGATAT 
TTTGCCCTAT GATTATAGCC GGGTAGAACT ATCCCTGATA ACCTCTGATG AGGAT rCCAG 
TTACATCAAT GCCAACTTCA TTAAGGGAGT TTATGGACCC AAGGCTTATA TTGCCAGCCA 
GGGTCCTTTA TCTACAACCC TCCTGGACTT CTGGAGGATG ATTTGGGAAT ATAGTGTCCT 
IATCATTGTT ATGGCATGCA TGGAGTATGA AATGGGAAAG AAAAAGTGTG AGCGClAClG 
GG^TGAGCCA GGAGAGATGC AGCTGGAATT TGGCCCTTTC TCTGTATCCT GTGAAGCTGA 
AAAAAGGAAA TCTGATTATA TAATCAGGAC TCTAAAAGTT AAGTTCAATA GTGAAACTCG 
AAC TATCTAC CAGTTTCATT ACAAGAATTG GCCAGACCAT GATGTACCTT CATCTATAGA 
rrrTATTPTT GAG^TCATCT GGGATGTACG TTGTTACCAA GAGGATGACA GTGTTCCCAT 
ATGCATTCAC TGCAGTGCIG GCTGTGGAAG GACTGGTGTT ATTTGTGCTA TTGATTATAC 



ATGGATGTTG C TAAAAGATG GGATAATTCC TGAGAACTTC AGTGTTTTCA GTTTGATCCG 
GGAAATGCGG ACACAGAGGC CTTCATTAGT TCAAACGCAG GAACAATATG AACTGGTCTA 
CAATGCTGTA TTAGAACTAT TTAAGAGACA GATGGATGTT ATCAGAGATA AAC ATT CTGG 
AACAGAGAGT CAAGCAAAGC ATTGTATTCC TGAGAAAAAT CACACTCTCC AAGCAGACTC 
TTATTCTCCT AATTTACCAA AAAGTACCAC AAAAGCAGCA AAAATGATGA ACCAACAAAG 
GACAAAAATG GAAATCAAAG AATCTTCTTC CTTTGACTTT AGGACTTCTG AAATAAGTGC 
AAAAGAAGAG CTAGTTTTGC ACCCTGCTAA ATCAAGCACT TCTTTTGACT TTCTGGAGCT 
AAATTACAGT TTTGACAAAA ATGCTGACAC AACCATGAAA TGGCAGACAA AGGCATTTCC 
AATAGTTGGG GAGCCTCTTC AGAAGCATCA AAGTTTGGAT TTGGGCTCTC TTTTGTTTGA 
GGGATGTTCT AATTCTAAAC CTGTAAATGC AGCAGGAAGA TATTTTAATT CAAAGGTGCC 
AATAACACGG ACCAAATCAA CTCCTTTTGA ATTGATACAG CAGAGAGAAA CCAAGGAGGT 
GGACAGCAAG GAAAACTTTT CTTATTTGGA ATCTCAACCA CATGATTCTT GT TTTGTAGA 
i GATGCAGGCT CAAAAAGTAA TGCATGTTTC TTCAGCAGAA CTGAATTATT CACTGCCATA 
TGACTCTAAA CACCAAATAC GTAATGCCTC TAATGTAAAG CACCATGACT CTAGTGCTCT 
TGGTGTATAT TCTTACATAC CTTTAGTGGA AAATCCTTAT TTTTCATCAT GGCCTCCAAG 
TGGTACCAGT TCTAAGATGT CTCTTGATTT ACCTGAGAAG CAAGATGGAA CTGTTTTTCC 
TTCTTCTCTG TTGCCAACAT CCTCTACATC CCTCTTCTCT TATTACAATT CACATGATTC 
TTTATCACTG AATTCTCCAA CCAATATTTC CTCACTATTG AACCAGGAGT CAGCTGTACT 
AGCAACTGCT CCAAGGATAG ATGATGAAAT CCCCCCTCCA CTTCCTGTAC GGACACCTGA 
ATCATTTATT GTGGTTGAGG AAGCTGGAGA ATTCTCACCA AATGTTCCCA AATCCTTATC 
CTCAGCTGTG AAGGTAAAAA TTGGAACATC ACTGGAATGG GGTGGAACAT CTGAACCAAA 
GAAATTTGAT GACTCTGTGA TACTTAGACC AAGCAAGAGT GTAAAACTCC GAAGTCCTAA 
ATCAGAACTA CATCAAGATC GTTCTTCTCC CCCACCTCCT CTCCCAGAAA GAACTC TAGA 
GTCCTTCTTT CTTGCCGATG AAGATTGTAT GCAGGCCCAA TCTATAGAAA CATATTCTAC 
TAGCTATCCT GACACCATGG AAAATTCAAC ATCTTCAAAA CAGACACTGA AGACTCCTGG 
AAAAAGTTTC ACAAGGAGTA AGAGTTTGAA AATTTTGCGA AACATGAAAA AGAGTAICTG 
TAATTCTTGC CCACCAAACA AGCCTGCAGA ATCTGTTCAG TCAAATAACT CCAGCTCATT 
TCTGAATTTT GGTTTTGCAA ACCGTTTTTC AAAACCCAAA GGACCAAGGA ATCCACCACC 
AACTTGGAAT ATTTAATAAA ACTCCAGATT TATAATAATA TGGGCTGCAA GTACACCTGC 
AAATAAAACT ACTAGAATAC TGCTAGTTAA AATAAGTGCT CTATATGCAT AATATCAAAT 
ATGAAGATAT GCTAATGTGT TAATAGCTTT TAAAAGAAAA GCAAAAIGCC AATAAG'TGCC 
AGT ITTGCAT TTTCATATCA TTTGCA TTGA GTTGAAAACT GCAAATAAAA GTTTGTCACT 
TGAGCTTATG TACAGAATGC TATATGAGAA ACACTTTTAG AATGGATTTA TTTTTCATTT 
TTGCCAGTTA TTTTTATTTT CTTTTACTTT TTTACATAAA CATAAACTTC AAAAGGTTTG 
TAAGATTTGG ATCTCAACTA ATTTCTACAT TGCCAGAATA T A C T A T A A A A AGTTAAAAAA 
AAACTTACTT TGTGGGTTGC AATACAAACT GCTCTTGACA ATGACTATTC CCTGACAGTT 



6 0 
120 

180 

24 0 

3 0 0 
360 

4 20 
4 30 
540 
6 0 0 

6 60 

7 20 
7 80 
840 
9 0 0 
9 60 

10 2 0 
1080 
114 0 
12 0 0 
12 6 0 
1320 
1380 

14 4 0 

15 0 0 
1560 
16 2 0 
] 680 
1740 
1 800 
1860 
19 2 0 

1 980 
2040 
2100 
2160 
2220 
2280 

2 3 40 
2 4 00 

24 60 

25 20 
2580 

26 4 0 
2700 
2760 
2820 
J880 
2940 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/822,29 5 

input" Set : N:\Crf3\RULE60\09822295.raw 
Output Set: N:\CRF3\01252002\l822295.raw 



DAT K 0 1/25/2002 
TIME I 6:51.27 



ATTTTTGCCT AAATGGAGTA TACCTTG i'AA ATCTTCCCAA ATGTTGTGGA -ACTGGAAT 3000 
V>5 ATTAAGAAAA TGAGAAATTA TATTTATTAG AATAAAA i G l ^aM,„n,, 3r , 0 

?, ~~» SSS ™= iSSSS SSSS j 

i ™ S SSSSS 55SSS =" s= »™ 

129 I^™g^ mtaaattgg CAGGTAATTG TTTTTACAAA GAATCCACCT 3360 

is £s ™- « = -™ slso 

ilJ ESS SSI SS TACAATGTAT CCAACACACA 3.0 

134 CTCAATAAAC TTTTTGGTTG TTAAAAAAAA AAAAAAAAAA 
H8 ^ INFORMATION FOR SEQ ID NO: 2: 
140 (i) SEQUENCE CHARACTERISTICS: 

142 (A) LENGTH: 807 amino a< 

(B) TYPE: amino acid 

144 (C) STRANDEDNESS: single 

145 (D) TOPOLOGY: linear 
147 (ii) MOLECULE TYPE: peptide 
149 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

151 Met Asp Gin Arg Glu He Leu Gin Lys Phe Leu As P Glu Ala Gin Ser 

III Lys Lys lie Thr L^s Glu Glu Phe Ala Asn Glu Phe Lou Lys Leu Lys 

\ll Arg Gin Ser Tnr Lys Tyr Lys Ala Asp Lys Thr Tyr Pro Thr Thr Val 

- c 4 0 43 

159 TlQ T „ Q t t r o A^a Tyr Lvs As d He Leu 

161 Ala Glu Lys Pro Lys Asn ire L/S As. A.g , 

50 _ 55 60 

164 



165 65 



162 Pro Tyr Asp Tyr Ser Arg Val Glu Leu Ser Leu lie Thr Ser Asp Glu 

' cr 70 

Asp Ser Ser Tyr lie Asn Ala Asn Phe lie Lys Gly Val Tyr Gly Pro 
85 90 

170 Lys Ala Tyr lie Ala Thr Gin Gly Pro Leu Ser Thr Thr Leu Leu Asp 

171 100 5 

173 Phe Trp Arg Met He Trp Glu Tyr Ser Val Leu lie lie Val Met Ala 
171 115 120 12b 

176 r ya Met Glu Tyr Glu Met Gly Lys Lys Lys Cys Glu Arg Tyr Trp Ala 

177 1?0 135 140 

\]l Glu III Gly Glu Met Gin Leu Glu Phe Gly Pro Phe Ser Val Ser Cys 
]ll Glu Ala Glu Lys Arg Lys Ser Asp Tyr lie lie Arg Thr Leu Lys Val 
1b5 Lys Phe Asn Ser Glu Thr Arg Thr Tie Tyr Gin Phe His Tyr Lys Asn 
18a Trp Pro Asp His As P Val Pro Ser ITr He Asp Pro lie Leu Glu Leu 



in* 195 200 205 

191 He Trp Asp Val Arg Cys Tyr Gin Glu Asp Asp Ser Val Pro lie Cys 

]ll lie His Cys Ser Ala Gly lH Gly Arg Thr Gly Val lie Cys Ala lie 

1,5 225 230 235 
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DATE: 01/2 5/2002 
RAW SEQUENCE LISTING „, 0 _- 00 = r ] MK ' 10:51:27 

PATKNT APPLICATION: US/09/822,295 IIMt ' 

T riDUt Set : N:\Crf3\RULE60\09822295.raw 
Output Set: N:\CRF3\01252002\l822295.raw 

_ . .... ^ t T~o M-t Leu Leu Lys Asp Gly He He Pro Gl u Asn Phe 
ly/ Asp i?i •■>»■ "t- '-- 250 z J ' 

1<,d no Arc Glu Met Arg Thr Gin Arg Pro Ser Leu 

200 Ser Val Phe Ser Leu lie Arg Glu Met a y ^ 

Mil 260 . A i a y a ]^ Leu Glu 

val Gin Thr Gin Glu Gin Tyr Glu Leu v.l Tyr 285 

!H 275 , .... ,. e »rq Asp Lvs His Ser Gly Thr 

j„6 Leu Phe Lys Arg Gin Met asp -1 -f ^ 

•U" 290 ■ I „. pro Glu Lys Asn Bis Thr Leu Gin 

no Glu Ser Gin Ala Lys Bis Cys lie Pro Glu Lys ^ 

^ fla Asp ser Tyr ser fro Asn Leu Pro Lys ser Thr Thr Lys Al, Ala 

A Sn III «. ™ ^ t «i. ^ ^ f° s « ~ 

I.! 7 , ser Phe Asp III Ar, Thr Ser Glu £ ser Al, Lys Glu Glu Leu v.l 

'A Leu His III Ala Lys Ser Ser Thr ser Phe Asp Phe Leu Glu Leu Asn 

SJ Tyr III Phe Asp Lys Asn ill « Thr Thr Met Lys Trp Gin Thr Lys 

ifs Ala Phe Pro ,1, val Gly Giu Pro Leu Gin Lys His Gin Ser Leu Asp 

?,? Leu Gly ser Leu Leu Phe Glu Gly Cys Ser Asn ser Lys Pre v.l Asn 

,H2 420 „ „ v ,i p r o lie Thr Arg Thr Lys 

, M Ma Ala Gly Arg Tyr Phe Asn ser \al Pro ^ . 

SS ser Thr Pro Phe Glu Leu Tie "„ Gin «, Glu Thr Lys Glu val Asp 

^8 450 ^ 5 Gl Pro His Asp Ser Cys 

no ser Lys Glu Asn Phe ser Tyr Leu Glu ser Gin ^ 

l\\ III v.l Glu »et Gin Ala Gin Lys Val Met His v.l ser ser Ala Glu 

l\l Leu Asn Tyr Ser Leu Pro Tyr Asp ser Lys His Gin He Arg Asn Al. 

% ser Asn Val Lys His His Asp ser Ser Al. Leu Gly V.l Tyr Ser Tyr 

% U e Pro Leu v.l Glu Asn Pro Tyr Phe Ser Ser Trp Pro Pro Ser Gly 

Thr III ser Lys Met Ser III Asp Leu Pro Glu Lys Gl„ Asp Gly Thr 

•:r. v" Phe Pro ser Ser Leu Leu Pro Thr ser Ser Thr ser Leu Phe Ser 

Tyr Tyr Asn ser „is\sP ser Leu Ser Leu Asn ser Pro Thr^Asn He 

J6i ?° ,=„ Gin Glu Ser Ala v.l Leu Al. Thr Ala Pro Arg 

,66 ser ser Leu Leu Asn Gin Glu ser a ^ 

; " • - A-„ Glu He Pro Pro Pro Leu Pro v.l Ar, Thr Pro Glu ser 

269 He Asp Asp Glu lie fiu fi 62Q 

270 610 6 j b h a pr p ro Asn Val Pro Lys 
272 Phe He Val Val Glu Glu Ala Gly Glu Phe Ser Pro 
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27 3 
275 
276 
'119 



DATE: 0 1/25/2002 
RAW SEQUENCE LISTING f 1 M E : 16:51:27 

PATENT APPLICATION: US/09/822 , 29 S 

input SPt : N:\Crf3\RULE60\09822295 raw 
Output Set: K:\CRE3\01252002\I822295.raw 

635 640 
' r v _i T [i e Glv Tnr bef i-eu i - r 

Ser Leu Ser Ser Ala Val Lys \al L,o lie . ^ 

.,„ ^C.se,"^,-*-'^.- - — 
?,? P ro Ser W . «r v., W . «u »p ser Pro W . ser 01. - 0 1. 

-;s „ P til - - - »° "» pr ° Glu ;s Thr Leu Glu s " 

™ Phe Z «. »i. »p - "p cy- £ G1 ° ser ue Glu 

= 88 705 _ _ ™ Thr Met Glu is n ser Thr ser ser Lys 

290 Tyr Ser mr at-r ± j ■• - _, 30 / 3j 

"oi 725 . _ ^v,^ a™ q^r t.vs Ser Leu 

* u r , t--q Thr Pro Glv Lvs ser t-ne — 

-)93 Gin Thr Leu LyS Ttir fit- ^ . - 7d0 

^ 740 M t Tv . Lvs S er lie Cys Asn Ser Cys Pro Pro 

. 96 L ys lie Leu Arg Asn Met Lys Lys ser ^ 

297 I" Ma Glu Ser Val Gin Ser Asn Asn Ser Ser Ser Phe Leu 

299 Asn Lys Pro Ala Glu Ser va 7gQ 

300 770 " L Pro Lys Gly Pro Arg Asn 
30' Asn Phe Gly Phe Ala Asn Arg Phe Ser Lys P 8Q0 

303 785 I tip 

305 Pro Pro Pro Thr Trp Asn lie 

306 805 

314 (2) INFORMATION FOR SEQ ID NO: 3: 

lf M) SEQUENCE CHARACTERISES: 

3^3 ( A) LENGTH : 23 base pairs 

(B ) TYPE : nucleic acid 
it ( C) STRANDEDNESS : single 

3^ ( D) TOPOLOGY: linear 

3^3 (ix) FEATURE: er The Ietter ^ stands for 

326 The letter "V" stands for A, C or 

328 The letter "R" stands for A or G. 

329 The letter "N" stands for A, C 

3r. ^ T (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 23 
3 34 GAYTTYTGGV RNATGRTNTG GGA 

,2) INFORMATION FOR SEQ ID NO^ 4: 
""" ' (i) SEQUENCE CHARACTERISTICS: 



{±) A.") ^LENGTH : 23 base pairs 

.3 (B) T ype: nucleic acid 

:^ 41 ( c) STRANDEDNESS: single 

;./ c rtn TOPOLOGY: linear 

^OTHER -FORMATION. The letter stands for C or G. 

G 



350 The letter " Y" stands for C or T 
,51 The letter "N" stands for A, C, G 
?52 or T . 
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DATF. : 01/2 5/2 00 2 
VERIFICATION SUMMARY ' . 

PATENT APPLICATION: US/09/822, 295 1 IML : 16 . .> 1. . <■« 

input Set : N:\Crf3\RULE60\09822295.raw 
Output Set: N:\CRF3\01252002\I822295.raw 

, „ ...o.n K.vword .isspelled or invalid format, [(A) APPLICATION NUMBER : ] 

^38 M-:220 C: Keyword .isspelled or invalid f ormat , , ( H , * > 

r-^Rfi M-341 W (46) "n" or "Xaa" used, for SEQ ID* . 5 
^lOM^lSi (46! "n- or "Xaa" used, for SEQ ID* : 6 
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